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Anoosheh Heidarzadeh and Alex Sprintson 


Abstract —Consider a set of clients in a broadcast network, 
each of which holds a subset of packets in the ground set X. In 
the (coded) cooperative data exchange problem, the clients need 
to recover all packets in X by exchanging coded packets over a 
lossless broadcast channel. Several previous works analyzed this 
problem under the assumption that each client initially holds 
a random subset of packets in A'. In this paper we consider a 
generalization of this problem for settings in which an unknown 
(but of a certain size) subset of clients are unreliable and their 
packet transmissions are subject to arbitrary erasures. For the 
special case of one unreliable client, we derive a closed-form 
expression for the minimum number of transmissions required 
for each reliable client to obtain all packets held by other 
reliable clients (with probability approaching 1 as the number 
of packets tends to infinity). Furthermore, for the cases with 
more than one unreliable client, we provide an approximation 
solution in which the number of transmissions per packet is 
within an arbitrarily small additive factor from the value of 
the optimal solution. 

I. Introduction 

Consider a network of clients that share a broadcast 
channel, each of which holds a subset of packets of the 
ground set X of size K. In the cooperative data exchange 
problem [1] (also known as universal recovery), each client 
wishes to recover all the packets in X. To achieve this 
goal, the clients exchange data by transmitting coded packets 
over a shared lossless broadcast channel. Assuming that each 
client knows which packets are known by all other clients, 
the problem is to specify how many and which (coded) 
packets each client requires to transmit so as to achieve the 
universal recovery. 

In this work, we consider a generalization of the co¬ 
operative data exchange problem, for the settings where a 
certain number of clients are unreliable and their packet 
transmissions are subject to arbitrary erasures. Specifically, 
our problem is to minimize the total number of transmissions 
required to achieve robust recovery, i.e., each reliable client 
can recover all packets held by the other reliable clients. 
Since the identity of the unreliable clients is unknown, 
the coding scheme must include redundant transmissions to 
tolerate a failure of a subset of clients of a certain size. 

This problem has several interesting practical applications. 
For instance, it captures the scenario where some clients, 
initially part of the network, leave the network (deliberately 
or not) before the end of the data exchange session. An¬ 
other example is the scenario where a subset of clients are 
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compromised by an adversary, and accordingly their packet 
transmissions can be dropped in an arbitrary manner. 

A. Related Work 

Recently, there has been a significant interest in the coop¬ 
erative data exchange problem, specifically due to the emer¬ 
gence of powerful techniques employing network coding [2], 
[3]. The cooperative data exchange problem was originally 
introduced in [4], where a broadcasting network was consid¬ 
ered, and was later generalized to arbitrary networks in [5]- 
[8]. Originally, lower and upper bounds on the minimum 
required number of transmissions were established in [9], and 
later, randomized and deterministic solutions to the problem 
were presented in [1], [10] and [11]. Scenarios considering 
various transmission costs have been studied in [12], [13], 
and scenarios providing secrecy and weak security, in the 
presence of an eavesdropper, have been considered in [14], 
[15] and [16], [17], respectively. 

To the best of our knowledge, the only “closed-form” 
solution to the cooperative data exchange problem (without 
unreliable clients) is given in [5], under the assumption of 
random packet distribution. This solution is shown to be 
correct with probability approaching 1 as the number of 
packets approaches infinity. Such a result, while asymptotic 
and exclusive to the random packet distribution, provides 
valuable theoretical insights as well as reasonable approxi¬ 
mation that can be used for constructing practical algorithms. 
However, this solution is limited to the settings in which 
all clients are reliable. This motivates the present work 
which attempts to bridge the gap and investigates closed- 
form (exact and approximate) solutions to the cooperative 
data exchange problem with unreliable clients, under the 
random packet distribution assumption. 

B. Our Contributions 

For the case with an arbitrary number of unreliable clients, 
we compute a closed-form approximate solution in which the 
total number of transmissions per packet (i.e., normalized by 
the number of packets K) is within an arbitrarily small (yet 
non-vanishing) additive factor of the optimal solution, with 
probability approaching 1 as K goes to infinity. 

Also, for the special case with one single unreliable client, 
we derive a closed-form exact solution which requires, with 
probability approaching 1 as AT goes to infinity, the minimum 
total number of transmissions. The exact solution yields a 
zero additive optimality gap (to the minimum total number 
of transmissions) and hence is stronger than our approximate 
solution which yields a nonzero gap that grows linearly 
with K. The strength of this result, however, comes with its 


restriction to a special case, and its generalization to settings 
with more than one unreliable client remains open. 

II. Problem Setup and Definitions 

Consider N clients and the set X of K packets 
Xi, X 2 , ■ ■ ■, Xk- We use the short notation [n] to represent 
the set {1,... ,n}, for any integer n. Each client i £ [AT] 
holds a subset Xi of the packets in the set X (without loss 
of generality, we assume X = U i <,;< ; y X,). We also denote 
by Xi = X \ Xi the set of packets missing at the client 
i. We further assume that each packet is available at each 
client, independently from other packets and clients, with 
probability (w.p.) a. (This assumption is referred to as the 
random packet distribution in [5].) 

We assume that M (0 < M < N) clients are “unreliable” 
and that the identity of unreliable clients is unknown. Each 
reliable client broadcasts over a lossless channel, while the 
packets broadcasted by each unreliable client are subject to 
arbitrary erasures. The goal of the cooperative data exchange 
in this setting is to achieve robust recovery that guarantees 
that each reliable client can recover all the packets known 
by the other reliable clients. We use the notion of robust 
recovery since universal recovery might not be achievable in 
our setting due to the fact that that it might not be possible 
to obtain a packet held by unreliable clients only. It is worth 
noting that for the special case with no unreliable clients 
(M = 0), the robust recovery problem becomes equivalent 
to the universal recovery problem. 

We further assume that each packet Xi is P-divisible, 
where P = N — M — 1, i.e., Xi can be partitioned into 
P chunks of equal size (the reason for this choice of P will 
become clear later), and transmissions can consist of a single 
chunk (as opposed to an entire packet). 

Let {ri} = {fi}({Xj}), 1 < i < N be the transmission 
schedule for a given instance {Xi} of the problem at hand. 
(By the P-divisibility assumption, it follows that for each 
client i, the number of its transmissions ri is a rational 
number of the form p, for some non-negative integer n). 
The transmission schedule {ri} is said to be feasible for 
instance {Xi} if there exists a coding scheme with each 
client i transmitting r, coded packets that achieves robust 
recovery. The following definitions assume that {Xi} is 
drawn according to the random packet distribution. 

Definition 1: The transmission schedule fry} is said to 
be feasible if it is feasible for a random instance { X ,} 
w.p. approaching 1 as K —> oo. 

Definition 2: The transmission schedule {r,} is said to be 
an exact solution if it is feasible and ri is equal to the 
minimum total number of transmissions required for robust 
recovery. 

Definition 3: The transmission schedule fry} is said to 
be an approximate solution if it is feasible and -A ri 
is within e of the ratio of the minimum total number of 
transmissions required for robust recovery to K, for any 
e > 0, w.p. approaching 1 as K —>• oo. 

In this work, our problem is to determine a closed-form 
exact or approximate solution for a random instance of 


the robust recovery problem. Given a feasible transmission 
schedule, the clients can achieve robust recovery (with high 
probability) by employing random linear network coding 
(over a sufficiently large finite field), i.e., transmitting ran¬ 
dom linear combinations of their packets. (This comes from 
the fact that the problem of robust recovery, similar to the 
problem of universal recovery [5]—[7], can be reduced to a 
multicast network coding problem.) 


III. Main Results 


For the special case with no unreliable client (M = 0), 
it was previously shown in [5] that the optimal number of 
transmissions for each client can be found by solving the 
following Linear Program (LP): 
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minimize 


5 >, 
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s.t. ^2 r i> P) Xi 

ieM i£jf 


, V0C7VC [N], 


where 77 = [IV] \ J\f. 

Now, consider the case with M unreliable clients. Since 
the set of unreliable clients (X) is not known apriori, the 
robust recovery is achievable so long as for every I C [N], 
\X\= M, each client i £ [A"] \ I can recover all the packets 
held by the other clients j £ [A'j \ X. In the case without 
unreliable clients, the set of packets each client i requires, 
Xi, is the collection of the packets available at the other 
clients (but not available at client i), i.e.. 


X, = X \ 


( 2 ) 


However, in the presence of unreliable clients, the set of 
packets each reliable client i ^ X requires, denoted by X ? j, 
is the collection of the packets each of which is available at 
some other reliable client (but not available at client i), i.e., 

Xi,x = U je[N]\xXj \ Xi. (3) 

Thus, we need to revise the set of constraints in 03 so as to 
take into account (i) every possible set of unreliable clients 
and (ii) the set of packets each reliable client requires for any 
possible subset of unreliable clients. The following theorem 
is a straightforward generalization of LP £[]) for the case with 
M unreliable clients, and appears without proof. 

Theorem 1 (Robust Recovery): The minimum total num¬ 
ber of transmissions required for robust recovery is the 


(4) 

P) Xi t z , 

i€ AT\X 

AT| <P}, 

where N is the number of clients, M is the number of 
unreliable clients. 


optimal value of the following LP: 
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Our goal is to solve LP (jTJi. It is noteworthy that LP ([T]). 
which is a special case of LP (0]l when M = 0, was 
previously given a closed-form exact solution in [5]: 

Theorem 2: [5, Theorem 4] { r t } is an exact solution to 

LP <H>: 

N 1 — — 

jy _ -j \ X t\ ~ | X i | J ■ ^ • 

3 = 1 


The following summarizes (into three steps) the technique 
which was previously used to solve LP (jT]i: 

(i) Choose the set of TV constraints in (QJ corresponding to 
the TV subsets {TV C [TV]: |TV| = TV — 1}. 

(ii) Solve the system of TV linear equations corresponding 
to the TV constraints of step (i) for the TV unknowns {r, } 
(where the inequalities are replaced with equality). 

(iii) Show the feasibility and optimality of the solution of 
step (ii) with respect to the rest of the constraints in ([]}. 

Now, a natural question is whether we can use such a 
deceptively simple, yet remarkably powerful, technique to 
solve LP (|4]l. The answer is positive, yet as we will show later 
the two steps (i) and (iii) require a significant amount of non¬ 
trivial modifications to become applicable to our problem. 
The complication comes from the fact that in our case, as 
opposed to the case with no unreliable clients, a “proper” 
choice of constraints in the step (i), yielding a solution in the 
step (ii) which is satisfactory with respect to the requirements 
in the step (iii), is not obvious. Also, it is not clear whether 
in our case such a proper choice of the constraints always 
exists. The following theorems summarize our main results. 
For the ease of exposition, we define 


kj = max 




1 <j<N-M, (5) 


Theorem 4 (Exact Solution): Assume that the clients are 
re-labeled such that kjj > kjj, 1 < i < j < TV@ Let Q > 1 
and 0 < R < 1 be some integers such that TV = 2(Q+1)—R. 
Then, {fy} is an exact solution to LP (QJ when M = 1: 

- f r - ki tN - ki !N -Q, l<i<N-Q , 

* \ f — k N - Q , N - k hu N - Q < i < N, {LV> 

where 
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Moreover, 
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p kl ’i + — -R- k l,N-Q- 


( 12 ) 


i—N—Q+1 

Proof: The proof is given in Section IIV-BI □ 


IV. Proofs 

In this section, we give the proofs of theorems [3] and 0] 
Before giving the proofs, for the ease of exposition we state 
a few definitions. Consider a generic LP as follows: 


where the maximization is over all i € AT and all {TV C 
[N] : |7V|= P, {n}i< n <j S TV}, and 

h,i = \ x i,{o} 1 , 1 < i 7 ^ 3 < N - ( 6 ) 

Theorem 3 (Approximate Solution): Assume that 

the clients are re-labeled such that k t > kj, 
1 < i < j < N — M\X\ Let Q > 1 and 0 < R < M be some 
integers such that N = ( M + 1)(Q + 1) — R. Then, {fi} is 
an approximate solution to LP (f4j: 


where 


( r-ki, 1 < i < Q, 
\ r - kQ+ 1 , Q < i < N, 



P 


— Q +1 

~ 


k Q+ 1 - 


(7) 

( 8 ) 


Moreover, 



N + Q(P-N) ^ 


Proof: The proof is given in Section IIV-AI 


k Q+ 1 - 

(9) 

□ 


According to (5), it is easy to see that such a re-labeling always exists. 


N 

minimize E n, 03) 

2=1 

s.t. ^ n > fdXij-Af), V0 C TV C [TV], 
ie M 

where f({Xi}\Af) is an arbitrary function of { X ,} and TV. 
The following definitions are with respect to LP (fl3l) : 

Definition 4: A sequence {r^} is said to be feasible if it 
satisfies the constraints for a random instance { X t } w.p. ap¬ 
proaching 1 as K —> oo. 

Definition 5: A sequence {ri} is said to be optimal if 

i ri is equal to the optimal value. Moreover, {ri} is said 
to be a solution if it is feasible and optimal. 

Definition 6: A sequence { r ,} is said to be near-optimal 
if I s within e of the optimal value normalized by 

K, for any e > 0, w.p. approaching 1 as K —> oo. Moreover, 
{ri} is said to be an approximate solution if it is feasible 
and near-optimal. 

2 The re-labeling procedure is as follows: for each n, starting from 1 and 
ending at TV — 1. switch the labels of clients n and n + 1 if and only if 
k n ,n +1 < kn+ i n- The proof is straightforward and follows from the fact 
that for any if kij > kjj and kjj > /;•/ ,. then k,j > kj,. 















A. Proof of Theorem [3] 

Consider a reduced version of LP 0 as follows: 


minimize 

s.t. 
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i,AT\{i} 


(14) 

V{7V : \Af\ = P}. 


(From now on, we adopt the notation A f to represent an 
arbitrary subset of [ N ], unless otherwise stated.) 

For arbitrary M, no closed-form solution to LP (fl4t is 
known. (However, we will give a closed-form solution to 
LP (l~m > later for the case of M = 1.) We, instead, give a 
closed-form solution to LP ( fT5]> . which we will construct by 
over-constraining LP (fl4l> . Next, we show that our solution 
to LP o is an approximate solution to LP and 

subsequently, LP (0), which was to be solved ultimately. 

We construct LP <U3 by replacing max- 6 ^|X-^ {i} | 
with kj (given by (0) in LP ( 1141 : 


N 

minimize E ru (15) 

2=1 

s.t. n > kj, VI < j < N — M, 

i&M 

V{AC C [N] \ {j} : |W|= P, {*}!<*<,• £ Af}. 

(By ©, kj > max- gXr |X.^ Ui} |, for all {Af : |A/] = 
P■ {*}l <i<j £ A/”}.) 

Lemma 1: {A} is an exact solution to LP (IT5l) . 

Proof: We prove the feasibility and the optimality of 
{fi} to LP (fl5l > in lemmas [3] and 01 respectively. □ 

The following lemma is useful in the proof of Lemma [3] 
Lemma 2: f\ < T 2 < ... < tq < Fq+i = • • • = fjv- 
Proof: By the assumption, 


ki > k2 > ■ ■ ■ > k^-M- 


(16) 


By definition, r, = fj+i, Q < i < N. Thus, it remains to 
show fi — fi +1 < 0, 1 < i < Q. By combining 0 and 0 
along with (fl6l . fi — fj+ 1 = k l+ i — ki < 0, 1 < i < Q. □ 
Lemma 3: {fi} is feasible with respect to LP (fl5l> . 

Proof: To prove the lemma it suffices (and we verify the 
sufficiency shortly) to show that {fi} meets the inequalities: 

Q 

'52r i -r j +{P-Q + l)r Q+1 > kj, VI < j < Q, (17) 

i=l 

and 

Q 

J2ri + (P- Q)r Q+ 1 > kj, VQ < j < N — M. (18) 

i= 1 

For every 1 < j < Q, the left-hand side (LHS) of the cor¬ 
responding inequality in ( 1171 ) is the smallest in comparison 
with that of the rest of the corresponding inequalities in (Q0. 
This comes from the fact that in comparison with the LHS 
of the inequalities in (fl5l >. the LHS of the inequalities in 
has the minimum number (P — Q + 1) of the (larger) 
terms fi, i > Q, or equivalently, the maximum number 


(Q — 1) of the (smaller) terms fi, i < Q (by Lemma [2] 
f\ < f 2 <■■■ < fq < f Q+ 1 = ■ ■ ■ = f N ). Thus if 
the inequalities in ([T71 > hold true for {fi}, then the rest of 
the inequalities in (Q0 obviously hold true. The LHS of the 
inequalities in (fT8l > are identical for every Q < j < N — M, 
and every such inequality holds true so long as the inequality 
with the largest right-hand side (RHS) (Lq. |_i) holds true. 
(By ( IT6l ). kQ +1 > ... > /cjv-M.) Thus, we can replace all 
the inequalities in ([!¥] > with one inequality: 

Q 

T,n + (P- Q) r Q+ 1 - k Q+ 1 - 

i= 1 

The rest of the proof is straightforward (and hence not 
included due to the lack of space) by showing that {r,} 
satisfies all the inequalities in (fI7] > and ( 1 1 9[ i with equality. □ 
Lemma 4: {A} is optimal with respect to LP (fl~5l) . 

Proof: Consider the dual of LP ( 1 13 l i: 

maximize EE kjSjif, (20) 

3 A3 

- EE SA3l{ieA3} <1, vi < i < N 

3 A3 

V{AA c [A3] \ {j} : |A r\= P, {*}!<,<,- G A f}, 
VI < j < N - M, 

(sa3 > 0). 

(We notice that A F depends on j, yet we use the same 
notation M, instead of A fj, for the ease of exposition.) 

We show that the duality gap with regards to LP (Q0 
and LP © is zero. To be more specific, we prove, by 
construction, there always exists a set S of A3 subsets A f 
such that {sat} is feasible to LP ( |20| ) so long as s_\: = -p, for 
every A(G S, and sjj = 0, for every A/ ^ S. By the structure 
of our construction process, {i} (1 < i < N ) belongs to P 
subsets Af G §. Thus, 

EE SA3l{ iG A3} = 1, VI < * < A3. 

3 A3 

(Every inequality in <[20l holds with equality.) Moreover, our 
choice of § has a partition {S^,... ,S^ +1 )} such that (i) 
|§C?)|= M + 1, 1 < j < Q, and |§(<3+ 1 )|= M - R + 1, 
and (ii) {j} Af and {i}i<i<j £ Af, for every Af G 
1 < j < Q + 1. By (i) and (ii), it is obvious that 

2^ fc 2 s A3 = 2^ —— k 3 +- p - k Q+ 1 . (21) 

3 A3 J=1 

By comparing 0 and (OH . it follows that the optimal values 
of the (primal) LP (00 and (its dual) LP ([20l i are equal: 

N 

E r ~3 = EEMA3, (22) 

2=1 j M 

since 2^2. = N ~ p anc ) m-r+i _ Q(n-p)-n . -pjj US ^ 
the duality principle. ( l22t proves the optimality of {fj}. 

The rest of the proof proceeds by the construction of set 
§ with properties (i) and (ii), defined earlier. Let §V), 1 < 










j < Q, be the set of all subsets {A/ - : {i}i<i<j 
A/", {i}j<i<Q £ A/”}, and §^ +1 ) be the set of all subsets 
{A f : {i}i<i<Q £ A f}. For every 1 < j < Q and every 
0 < m < M, or j = Q + 1 and every 0 < m < M — R, 
construct the (auxiliary) set x (in a recursive manner): 

S^+i = U AT, 

for arbitrary TV £ SV\ {Q < i k < N} 1 < k < P _ Q+1 £ TV, 

such that 


(S&>) < .. < n ip _ Q+1 (& <-.< n iN _ Q (§W), 


where nj(Sm ) is the number of subsets {TV : TV £ 

{*} £ TV}; S ( 0 j) = 1 < j < Q + 1, and §< 1} = 0. 

(Such a subset TV always exists since there exists a unique 
TV £ §(V such that {i} ig x £ TV, for every I C [AT] \ [Q] and 
\I\= P — Q + 1.) Now, we can construct partitions (S^)}; 




1 <3<Q, 

§M-R+ 1-S^, J = Q + 1. 


(23) 


By construction, both properties (i) and (ii) hold so long as P 
and only P subsets TV £ S exist such that { i } £ TV (for every 
i). By the definition of §, obviously n, (§) = P, for every 
1 < i < Q. Thus, it suffices to show rii(S) = P, for every 
Q < i < TV. Let be m when m subsets are chosen. 
By the structure of the construction, it is not hard to see 
at each step m of the selection of one new subset for every 
Q < i < N, either n, increases by one or it does not change. 
(Thus, 0 < n( n ' > — < 1.) Since at the beginning of 

the process (when no subset is chosen) nf^ = 0, for every 
i, n[ m \ for every 1 < m < N, is either norn-1, for some 
1 < n < m (depending on N and M). It suffices to show 
that n|' V ' ) = P, for every Q < i < N. Let 


I(m,n) = \{Q < i < N : n[ m ' > = n}\. 


(Thus, \{Q < i < N : n^ = n— 1}}|= N — Q — I(m , n).) 
It is easy to see that = P so long as /(TV, P) = N — Q 
(i.e., N — Q — /(TV, P) = 0). By analyzing the construction 
process step by step, it can be shown that 


I(m, n) = m(P — Q + 1) - (n - 1)(AT — Q) + £(m), 


where 

£(m) 


0, l<m<{M + l)Q, 

(M + 1)Q — to, (M + 1)Q < m < N. 


It is easy to verify /(TV, P) = TV — Q, and thus, for every 
Q < i < TV, n[ NS> = P. This completes the proof. □ 

The following result follows from the random packet 
distribution assumption (by the application of the law of large 
numbers), and is useful in the proof of Lemma [6] 

Lemma 5: For every TV C [TV], |TV|= V (0 < V < TV — 
TV/), and 1 C TV, |X|= M (0 < M < TV), for any e > 0, 
w.p. approaching 1 as K —>• oo, we have 
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(1 - a)~ N - 1 
Proof: By (0, it can be easily shown that 

Xi,x = Xi \ (UjElXj \ Uj Gp Xj), 

and further. 
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where I = [TV] \ I. For every x n £ X, 

-I 


Pr ^ x n £ P| Xi} = 
ie77\x 


and 
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(1 — a)~ M — 1 
(1 — a)~ N — 1 ’ 


The followings hold true for any e > 0, w.p. approaching 1 
as K —> oo. By the law of large numbers, 
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Thus, combining (f25l> and (f26t together with 
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< e, 


w.p. approaching 1 as K —> oo. □ 

Lemma 6: {A} is an approximate solution to LP {14} and 
LP ©. 

Proof: Since the objective functions in LP ( 1 1 41 ) and 
LP © are identical, and the constraints in LP (ITdl) are a 
subset of the constraints in LP ©, the following observations 
are straightforward: (i) if {r,} is feasible to LP ©, then it 
is feasible to LP (fl4l ). and (ii) if {r,} is near-optimal to 
LP ( 1 1 4l i. then it is near-optimal to LP ©. Thus, it suffices to 
show {fi} is feasible to LP © (Lemma 0 and near-optimal 
to LP {]© (Lemma©. □ 

Lemma 7: {A} is feasible with respect to LP ©. 

Proof: The feasibility follows immediately so long as 
{fi} meets the inequalities: 
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By Lemma |2] 


E^E*- V{AT : |A/] = V}. 

i£ A/" 2—1 


Thus, it suffices to show 
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, VI < V < P. 


The following is true (by the result of Lemma |2 w.p. ap¬ 
proaching 1 as K —> oo. For any e > 0, 
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Thus, we need to show 
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w.p. approaching 1 as K oo. We consider two cases: (i) 
V > Q and (ii) V < Q. In Case (i), 

v Q 


5 > = 5 >(^ 

i =i j=i v 

and in Case (ii), 
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2 =1 3 = 1 

+ ^Q +1 


fc Q+l 


F + Q(P-F) 
P 


(28) 


V-P 

P 


Q 

■ E k i 

3=V +1 


(29) 


C + X(P-Q) 
P 


Let Cj, 1 < j < Q, be equal to e in Case (i) and Case (ii), 
9iErYl a rw P Z^ e in Case (i) 


and £q + i be equal to Q(p _ V j + 
or Case (ii), respectively. By the result of Lemma [5] 


Q(P-V) + V e 01 P-Q +1 


kj 

-%■ > ZM,P ~ Cj, 


(30) 


w.p. approaching 1, when K —» oo. By combining 
or < [29t together with (l30ll . we get 


1 V- ~ ^ ~ 

x2^ ri> p Z M,P + e, 


(31) 


2 =1 


in Case (i) or (ii), respectively. By comparing (l27l > and ( 13 1 L 
one can see we need to show 


vi <v<p 

-T AM,P 


(32) 


w.p. approaching 1 as K —> oo (for every 0 < a < 1). By 
substituting Zm,v and Zm,p into <E2, we get 


where 


V?(a) = 


V , , 

P > vw, 


(1 - a) p ~ v - (1 - a) 1 
1 — (1 — a) p 


(33) 


It is easy to see <^(1) = 0 and (p(a) —>• p as a —> 0. By 
definition, jj < 1. Thus, d33l > holds so long as ip'(a) < 0 , 
for every 0 < a < 1 , where <p'(a) is the derivative of the 
function ip(a) with respect to a (i.e., ip(a) is decreasing, 
bounded from above by p). It is easy to see <p'(a) < 0 so 
long as 


1 1 ^ (l-a) v (1 — a) p 

V ~ P > V ~ P 


(34) 


Since 

1 (1 -a) n 

~ > -—, 

n n 

for every n > 0 and every 0 < a < 1, ( 134} holds so long as 


1 1 (1 - a) n (1 - a) n+1 

n n + 1 n n + 1 

or equivalently, 


1 — (1 — a) n ^ n 
1 - (1 - a) n+1 > n+ 1 ’ 


(35) 


for every n > 0 and every 0 < a < 1. Let 

, 1 - (1 - a) n 

7(a) 1 — (1 — a ) n+1 ' 

Since 7 ( 1 ) = 1 and 7 (a) —> 777 , as a —> 0, (135} holds 
so long as ^’(a) > 0 , for every a (i.e., 7 (a) is increasing, 
bounded from below by 777 ). It is easy to see 7 '(a) > 0 
so long as X!m=i(l — a ) m < n ’ which obviously holds true 
for every 0 < a < 1 . □ 

Lemma 8: {77} is near-optimal with respect to LP ( 1141 . 
Proof: The dual of LP ([14} can be written as: 


maximize 

s.t. 


E 


max 

2 GAT 


X 




Sjif, 


E s x1{2 G aO <1, VI < i<N 

N 

V{W C [X] : \Af\= P}, 


{sat > 0 ). 


(36) 


Let r* K be the optimal value of LP (ITdl) . By the definition of 
the near-optimality, we require to show 


1 

K 


N 1 
^ h < —r* K 




K 


(37) 


for any e > 0. To do so, we use the set § = 

{§*4),... ,§( < 3 + 1 i} which we previously constructed in the 
proof of Lemma [2 and set .77 = j-,. for every AT £ S, and 
.77 = 0, for every J\f f S. Since LP (l36t and LP ( f20l i have 
identical constraints, {. 77 }, which was shown to be feasible 
with respect to LP d20l >. is feasible with respect to LP d36l >. 
Thus, by the duality principle. 


r 


* > 
K Z 


E 


max 

ieX 


X 


2 ,Ar\{ 2 } 


SM 


1 

p 


Q +1 

E E 


i=i jVesW) 


max 

2 GAT 




( 38 ) 































The following results hold w.p. approaching 1 as K —» oo 
(by the results of Lemma [3. For any { 6 j > 0 } and every 

N e 


l 

— max 

k iG jr 


x 


i,AT\{i} 


>Zm,p VI <i < 0 + 1. (39) 


Since |§W|= M + 1, 1 < j < Q, and |§(<2 +1 )|= M-f?+l, 
combining (l38l > and (l39l > we can write 

Q 


-E 

k2 j 


max 
i eJ7 


X i,SS\{i} 


IV r7 V—' 

sj\r > -pZ M , P — } J 

0 =1 


M + 1 


M — R + 1\ €q+i 


Similarly, 


|< z 


m,p + 7^, VI < j < 0 + 1- 


(40) 


(41) 


By combining (1411 together with <(+]), we get 

JV Ar Q 


1 

K 


\ ' ~ -/V ^7 v—> 

n < + 


p 


+ 


AV-P 


J=1 

AV + Q(P-AV)\ eQ+1 


(42) 


By combining (l42t and (|4(Tb together with (|38l >, one can 
see 03 holds so long as 

Q 


e >E 

f=i 


M + l 
P 


e-i 


M-P+l 


e Q+i- (43) 


The RHS of (1431) can be made arbitrarily close to 0, and this 
completes the proof. □ 

B. Proof of Theorem [4] 

LP (IT4l) can be rewritten as (when M = 1 ): 


N 


minimize 


E ri > 

i=l 


(44) 


s.t. ^ Vi > k m ,n , VI < m < n < N, 

iePm.n 

where k m ,n is given by ©, and Af rn , n = [AV] \ {m, n}. 

The following two lemmas are useful in the proof of the 
theorem. (The proofs are straightforward and hence omitted). 
Lemma 9: For every 1 < mi, m 2 , rti, ri 2 < AV, 

krni,ni ^mi,n 2 — km 2 ,ni (m 2 ,n 2 ! (45) 

so long as {mi, m 2 } D {ni, 712 } = 0- 

Lemma 10: For every 1 < n < m\ < m 2 < N, 

km\,n km, 2 , ti — A+m i A) n>m2 . (46) 

For every 1 < m < n < N, let A mr , be defined as 

k m ,N — fcl.JV + A+jV-Q 

-kN-Q,N + A+iv, m < n < N - Q, 
Am.n = * k m ,N A+jV + A+n, 777 + AV Q + 71, 
A+rn — A+jV + Ln~Q,N 

—ki,N-Q + ki, n , N ~ Q <m <n. 

( 47 ) 


By applying the results of lemmas [9] and [TO] the following 
result can then be shown. (The proof is omitted due to the 
lack of space.) 

Lemma 11: For every 1 < m < n < N, 


We now construct LP (l48l > (by over-constraining LP ( 1441 ) ): 

N 

minimize E r i} (48) 

1=1 

s.t. ^ n > X m ,n, VI < m < n < N. 
i 6 jV m ,„ 

(By the result of Lemma fTTl it is easy to see the constraints 
in (l48l) are stronger than those in (144] ).') 

We now prove the theorem in two steps: (i) we show {fj} 
gives an exact solution to LP (l48l) . and (ii) we show {fi} is 
an exact solution to LP ( ITTl ) and LP £1]). 

Lemma 12: {fj} is an exact solution to LP (1481) . 

Proof: We prove the feasibility and the optimality of 
{fj} to LP (l48l > in lemmas fl3l and fl4l respectively. □ 
Lemma 13: {fj} is feasible with respect to LP (l48l ). 
Proof: Since A = kj^N (1 < j < N — Q) and 
Ai,j = k\,j (N — Q < j < N), by ( |471 i it is not hard to see 
that {fj} meets (with equality) every inequality in LP ( l48l ) 
so long as {fj} meets (with equality) the N inequalities: 

E Ti - VI < j < N - Q, 

i£.Afj t N 

E Ti - VAV - Q < j < N, 

Furthermore, the proof of the latter is straightforward (and 
hence omitted). □ 

Lemma 14: {fj} is optimal with respect to LP (1481) . 
Proof: The dual of LP (l48l > is given by 

maximize A mira +y m n , (49) 

m,n 

— 1; VI + 7 + AV 

m,n 

VI < m < 71 < N, 

(s7V m ,„ > 0 ). 

Similar to the proof of Lemma [4] we construct the set S = 
{S^ 1 ),..., §( < 3+ 1 )} of N subsets N m , n such that {sj^ m „} is 
feasible to LP (|49| >. where sj^ m n = -p, for every Af m , n G S, 
and stf m n = 0, for every Af mtn ^ §. Considering four cases 
(depending on Q and R ), we construct the partitions {S-V}: 

(i) Q odd, R = 0: = 

1 < j < ^ 5 -^, and S^A) = \N 2 j-Q- 2 .N-j +\, 

J^2j-Q-l,N-j+l}, < j < Q + 1- 

(ii) Q odd, R = 1: N' 2 j,N-j+i}, 

1 < j < and = {-Xf‘2j-Q-2.N-j+i , 

< j < Q, and §«2 +1 ) = 

{AVq.at-q}. 

(iii) Q even, R = 0: § w = {AV 2 j-i,jv-j+i, N'i 3 ,N-j+i}, 

1 < 3 < and ^ ( ^ +1) = {AVq +1iJV _q, AV 1jv _q}, 
















( 53 ) 


and —Q— 2 .N—j 5 A/2j-Q-i,AT-j}> ^ + 1 < 

i < Q + 1- 

(iv) Q even, i? = 1: = {A4j-i,JV-j+i, U 2 j,N-j+i}, 

1 < j < %, and §(t+ 1 ) = {A/g + 1 JV _s, AC liJV _a}, 

and SW) = { J^2j-Q-2,N-j, J^2j-Q-l,N-j}, % + 1 < 
J < 0 , and S«W= {N Q , N -q} 

In each case (i)-(iv), it is easy to see {*}, 1 < i < N, 
belongs to P subsets J\f m , n £ S. Thus, 

^ ' Sj\f 7 n,n-^{j^{m,n}} = 1; VI — ^ ^ • 

m,n 

This confirms the feasibility of {s^f m n }. Then, by the duality 
principle, it suffices to show that 

N 

^ ^ Ti = ^ ^ ^m,nSj\f rnrL • 
i=l m,n 


We only give the proof for the case (i) here (and the proofs 
for the other cases are similar). In the case (i), by our choice 
of J2 m ,n ^m,nSM m , n equals to 



By using (147b . (f50b can be written as 


2-2 Q 
P~ 


kl,N 


Q +1 9 N -1 „ 

E + E p fc M> 

i=2 i=N-Q 


which equals to ^i, since in the case (i) R = 0 (by 

assumption) and thus N — Q — 1 = Q + 1 and P = 2 Q. □ 
Lemma 15: {A} is an exact solution to LP (144b and 
LP 0. 

Proof: By a similar argument as in the proof of 
Lemma [ 6 ] it suffices to show the feasibility and optimality 
of {fi} with respect to LP 0 (Lemma fl 6 l > and LP 03} 
(Lemma fT71>. respectively. □ 

Lemma 16: {fi} is feasible with respect to LP 0. 

Proof: We assume K —> oo, and the results of Lemma[5] 
hold w.p. approaching 1, for any e > 0. We need to show 
that {fi} meets the inequalities: 


1 1 

— > > max — 

K i c M K 


n ^,ii> 


,V{W: 1 < |A/}< P}. 


From Lemma [5] for every J\f C [N], |W|= V, it follows 


max 
3 CAT 


n 

V 




Thus, it suffices to show 


< e, Vl < V < P. 

(51) 


Z iy + e, VI <V<P, (52) 

ie M 


for every Jf C [N], 1^1= V. Moreover, 


By combining (|53] > with ( I Kil l and (fl2l ). we can write 

^ E + e, VI < V < P, (54) 

i&M 

for every A f C [ N ], |A/"|= V. From (l52l > and (l54l) . one can 
see that the proof of the lemma is complete so long as 

^ VI < V < P. (55) 

p ^i,p 

Furthermore, i [55l i is a special case of (1321 ). which was 
previously shown in the proof of Lemma [7] □ 

Lemma 17: {fj} is optimal with respect to LP (1441 ). 
Proof: The proof follows the exact same line as in the 
proof of Lemma fl4l (and hence omitted to avoid repetition) 
since: (i) X m ^ n = k rrltn , for every m < N — Q < n (by 
(l47l >). and (ii) m < N — Q < n, for every A f m ,n € S. □ 
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